Blog

  • Advances in diagnosis and management of systemic light chain amyloidosis

    Advances in diagnosis and management of systemic light chain amyloidosis

    Systemic light chain (AL) amyloidosis is a rare and life-threatening disorder characterized by the deposition of misfolded immunoglobulin light chains as insoluble amyloid fibrils in various tissues and organs, leading to…

    Continue Reading

  • Just a moment…

    Just a moment…

    Continue Reading

  • New chemotherapeutic agent shows remarkable efficacy against multiple tumor types

    New chemotherapeutic agent shows remarkable efficacy against multiple tumor types

    A research team led by the Medical University of Vienna, the HUN-REN Research Centre for Natural Sciences and the Eötvös Loránd University in Budapest has developed a groundbreaking new chemotherapeutic agent, LiPyDau, which…

    Continue Reading

  • Seth Meyers on Trump’s South Korea visit: ‘Getting the royal treatment he so desperately craves’ | Late-night TV roundup

    Seth Meyers on Trump’s South Korea visit: ‘Getting the royal treatment he so desperately craves’ | Late-night TV roundup

    Late-night hosts recapped Donald Trump’s lavish visit to South Korea, where he received a ceremonial golden crown.

    Seth Meyers

    Trump continued his tour of Asia on Wednesday, where he’s been “getting the royal treatment he so desperately…

    Continue Reading

  • Free Play Days – Call of Duty Black Ops 6 (MP & Zombies), Dragon Ball Xenoverse 2, Dead by Daylight, CATAN, and More – Xbox Wire

    1. Free Play Days – Call of Duty Black Ops 6 (MP & Zombies), Dragon Ball Xenoverse 2, Dead by Daylight, CATAN, and More  Xbox Wire
    2. Activision Announces Call of Duty: Black Ops 6 Multiplayer and Zombies Go Free Again Today, Just as EA Launches…

    Continue Reading

  • Xbox shoots for the moon with Snapchat activation amid pricing furor

    Xbox shoots for the moon with Snapchat activation amid pricing furor

    This audio is auto-generated. Please let us know if you have

    Continue Reading

  • Comet Lemmon will be a Halloween treat – Astronomy Magazine

    1. Comet Lemmon will be a Halloween treat  Astronomy Magazine
    2. ‘Miracle’ photo captures Comet Lemmon and meteor seemingly entwined over Earth  Live Science
    3. Comet Tracker For Wednesday: See Comets For Final Time As Moon Grows  Forbes
    4. Starwatchers capture…

    Continue Reading

  • The Strad news – Hilary Hahn cancels three Berlin Philharmonic dates in December

    The Strad news – Hilary Hahn cancels three Berlin Philharmonic dates in December

    Read more news stories here 

    Hilary Hahn has had to cancel further concerts in December as she continues to recover from her ongoing double pinched nerve injury. The violinist was due to play Dvořák’s Violin Concerto in A minor with the…

    Continue Reading

  • Global Virus Network announces the addition of three new Centers of Excellence

    Global Virus Network announces the addition of three new Centers of Excellence

    The Global Virus Network (GVN), a coalition of leading medical virologists representing 80+ Centers of Excellence and Affiliates in 40+ countries, today announced the addition of three new Centers of Excellence: the USF Health…

    Continue Reading

  • OpenSpliceAI provides an efficient modular implementation of SpliceAI enabling easy retraining across nonhuman species

    OpenSpliceAI provides an efficient modular implementation of SpliceAI enabling easy retraining across nonhuman species

    We developed OpenSpliceAI to be a modular Python toolkit designed as an open-source implementation of SpliceAI, to which we added several key enhancements. The framework replicates the core logic of the SpliceAI model while optimizing prediction efficiency and variant effect analysis, such as acceptor and donor gains or losses, using pre-trained models. Our benchmarks show substantial computational advantages over SpliceAI, with faster processing, lower memory usage, and improved GPU efficiency (Figure 2B, Figure 2—figure supplement 6). These improvements are driven by our optimized PyTorch implementation that employs dynamic computation graphs and on-demand GPU memory allocation – allowing memory to be allocated and freed as needed – in contrast to SpliceAI’s static, Keras-based TensorFlow approach, which pre-allocates memory for the worst-case input size. In SpliceAI, this rigid memory allocation leads to high memory overhead and frequent out-of-memory errors when handling large datasets through large loop iteration prediction. Additionally, OpenSpliceAI leverages streamlined data handling and enhanced parallelization through batch prediction and multiprocessing, automatically distributing tasks across available threads. Together, these features prevent the memory pitfalls common in SpliceAI and make OpenSpliceAI a more scalable and efficient solution for large-scale genomic analysis.

    It is important to note that even though OpenSpliceAI and SpliceAI share the same model architecture, the released trained models are not identical. The variability observed between our models and the original SpliceAI – and even among successive training runs using the same code and data – can be attributed to several sources of inherent randomness. First, weight initialization is performed randomly for many layers, which means that different initial weights can lead to distinct convergence paths and final model parameters. Second, the process of data shuffling alters the composition of mini-batches during training, impacting both the training dynamics and the statistics computed in batch normalization layers. Although batch normalization is deterministic for a fixed mini-batch, its reliance on batch statistics introduces variability due to the random sampling of data. Finally, OpenSpliceAI employs the AdamW optimizer (Loshchilov and Hutter, 2019), which incorporates exponential moving averages of the first and second moments of the gradients. This mechanism serves a momentum-like role, contributing to an adaptive learning process that is inherently stochastic. Moreover, subtle differences in the order of operations or floating-point arithmetic, particularly in distributed computing environments, can further amplify this stochastic behavior. Together, these factors contribute to the observed nondeterministic behavior, resulting in slight discrepancies between our trained models and the original SpliceAI, as well as among successive training sessions under identical conditions.

    OpenSpliceAI empowers researchers to adapt the framework to many other species by including modules that enable easy retraining. For closely related species such as mice, our retrained model demonstrated comparable or slightly better precision than the human-based SpliceAI model. For more distant species such as A. thaliana, whose genomic structure differs substantially from humans, retraining OpenSpliceAI yields much greater improvements in accuracy. Our initial release includes models trained on the human MANE genome annotation and four additional species: mouse, zebrafish, honeybee, and A. thaliana. We also evaluated pre-training on mouse (OSAIMouse), honeybee (OSAIHoneybee), zebrafish (OSAIZebrafish), and Arabidopsis (OSAIArabidopsis) followed by fine-tuning on the human MANE dataset. While cross-species pre-training substantially accelerated convergence during fine-tuning, the final human splicing prediction accuracy was comparable to that of a model trained from scratch on human data. This result indicates that our architecture seems to capture all relevant splicing features from human training data alone and thus gains little or no benefit from cross-species transfer learning in this context (see Figure 4—figure supplement 5).

    OpenSpliceAI also includes modules for transfer learning, allowing researchers to initialize models with weights learned on other species. In our transfer learning experiments, models transferred from human to other species displayed faster convergence and higher stability, with potential for increased accuracy. We also incorporate model calibration via temperature scaling, providing better alignment between predicted probabilities and empirical distributions.

    The ISM study revealed that OSAIMANE and SpliceAI made predictions using very similar sets of motifs (Figure 6B). Across several experiments, we note that SpliceAI exhibits an inherent bias near the starts and ends of transcripts which are padded with flanking Ns (as was done in the original study), predicting donor and acceptor sites in these boundaries with an extremely high signal that disappears when the sequence is padded with the actual genomic sequence. For example, the model correctly predicted the first donor site of the CFTR gene when the gene’s boundaries were flanked with N’s; however, when replaced those N’s with the actual DNA sequence upstream of the gene boundary, the signal all but disappeared, as seen in Figure 6D. This suggests a bias resulting from the way the model is trained. In our ISM benchmarks, we thus chose not to use flanking N’s unless explicitly recreating a study from the original SpliceAI paper.

    Additionally, we note that both the SpliceAI and OSAIMANE ‘models’ are the averaged result of five individual models, each initialized with slightly different weights. During the prediction process, each individual model was found to have discernibly different performance. By averaging their outputs leveraging the deep-ensemble approach (Fort et al., 2019; Lakshminarayanan et al., 2017), the overall performance of both SpliceAI and OpenSpliceAI improved while reducing sensitivity to local variations. In essence, this method normalizes the inherent randomness of the individual models, resulting in predictions that are more robust and better represent the expected behavior, ultimately yielding improved average performance across large datasets. OpenSpliceAI’s ‘predict’ submodule averages across all five models by default, but it also supports prediction using a single model.

    In summary, OpenSpliceAI is a fully open-source, accessible, and computationally efficient deep learning system for splice site prediction. Its modular architecture, enhanced performance, and adaptability to diverse species make it a powerful tool for advancing research on gene regulation and splicing across diverse species.

    Continue Reading